RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/945, 182 



DATE: 01/11/2002 
TIME: 22:33:09 



INPUT SET: S36718.raw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



SEQUENCE LISTING 



General Information: 



(i) APPLICANT: Celeste, Anthony J.. 



ENTERED 



Wozney, John 
Rosen, Vicki A. * 
Wolf man, Neil 



Thomseh, Gerald H. 
Melton, Douglas A. 



(ii) TITLE OF INVENTION: TENDON- INDUCING COMPOSITIONS 
(iii) NUMBER OF SEQUENCES: 35' 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: GENETICS INSTITUTE, INC. 

(B) STREET: 87 Cambridge Park Drive 

(C) CITY: Cambridge 

(D) STATE: Massachusetts 

(E) COUNTRY: USA 

(F) ZIP: 02140 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #l!o, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

0& APPLICATION NUMBER: 09/945,182 
(Si FILING DATE: 



^(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/808, 324 

(B) FILING DATE: 



viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Lazar, Steven R. 

(B) REGISTRATION NUMBER: 32,618 

(C) REFERENCE/DOCKET NUMBER: 5202-D 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 617 498-8260 




♦ ft 

PAGE: 2 RAW SEQUENCE LISTING DATE: 01/1 1/2002 

PATENT APPLICATION US/09/945,182 TIME: 22:33:09 

INPUT SET: S36718.raw 

47 (B) TELEFAX: 617 876-5851 

48 

49 

50 (2) INFORMATION- FOR SEQ ID NO:l: 
51 

52 (i) SEQUENCE CHARACTERISTICS: 

53 (A) LENGTH: 926 base pairs 

54 (B) TYPE: nucleic acid 

55 (C) STRANDEDNESS : single 

56 (D) TOPOLOGY: linear 
57 

58 (ii) MOLECULE TYPE: DNA (genomic) 

59 

60 (vi) ORIGINAL SOURCE: 

61 (A) ORGANISM: Homo sapiens 
62 

63 (vii) IMMEDIATE SOURCE: 

64. (B) CLONE: vl-1 

65 

66 (ix) FEATURE: 

67 (A) NAME /KEY : mat_peptide 

68 (B) LOCATION: 571. .882 
69 

70 (ix) FEATURE: 

71 (A) NAME/KEY: CDS 

72 (B) LOCATION: 1..882 
73 

74 

75 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

76 

77 GCG CGT AAT ACG ACT CAC TAT AGG GCG AAT TGG GTA CGG GGC CCA GGC 48 

78 Ala Arg Asn Thr Thr His Tyr Arg Ala Asn Trp Val Arg Gly Pro Gly 

79 -190 -185 -180 -175 
80 

81 AGC TGG ACT TCT CCG CCG TTG CTG CTG CTG TCC ACG TGC CCG GGC GCC 96 

82 Ser Trp Thr Ser Pro Pro Leu Leu Leu Leu Ser Thr Cys Pro Gly Ala 

83 -170 -165 -160 
84 

85 GCC CGA GCG CCA CGC CTG CTG TAC TCG CGG GCA GCT GAG CCC CTA GTC 144 

86 Ala Arg Ala Pro Arg Leu Leu Tyr Ser Arg Ala Ala Glu Pro Leu Val 

87 -155 -150 -145 
88 

8 9 GGT CAG CGC TGG GAG GCG TTC GAC GTG GCG GAC GCC ATG AGG CGC CAC 192 

90 Gly Gin Arg Trp Glu Ala Phe Asp Val Ala Asp Ala Met Arg Arg His 

91 -140 -135 -130 
92 

93 CGT CGT GAA CCG CGC CCC CCC CGC GCG TTC TGC CTC TTG CTG CGC GCA 240 

94 Arg Arg Glu Pro Arg Pro Pro Arg Ala Phe Cys Leu Leu Leu Arg Ala 

95 -125 -120 -115 
96 

97 GTG GCA GGC CCG GTG CCG AGC CCG TTG GCA CTG CGG CGA CTG GGC TTC 288 

98 Val Ala Gly Pro Val Pro Ser Pro Leu Ala Leu Arg Arg Leu Gly Phe 

99 -110 -105 -100 -95 



§ 



PAGE: 3 RAW SEQUENCE LISTING DATE: 01/11/2002 

PATENT APPLICATION US/09/945,182 TIME: 22:33:10 

INPUT SET: S36718.raw 

100 

101 GGC TGG CCG GGC GGA GGG GGC TCT GCG GCA GAG GAG CGC GCG GTG CTA 3 36 

102 Gly Trp Pro Gly Gly Gly Gly Ser Ala Ala Glu Glu Arg Ala Val Leu 

103 -90 -85 -80 
104 

105 GTC GTC TCC TCC CGC ACG CAG AGG AAA GAG AGC TTA TTC CGG GAG ATC 3 84 

106 Val Val Ser Ser Arg Thr Gin Arg Lys Glu Ser Leu Phe Arg Glu lie 

107 -75 -70 -65 
108 

109 CGC GCC CAG GCC CGC GCG CTC GGG GCC GCT CTG GCC TCA GAG CCG CTG 432 

110 Arg Ala Gin Ala Arg Ala Leu Gly Ala Ala Leu Ala Ser Glu Pro Leu 

111 -60 -55 -50 
112 

113 CCC GAC CCA GGA ACC GGC ACC GCG TCG CCA AGG GCA GTC ATT GGC GGC 480 

114 Pro Asp Pro Gly Thr Gly Thr Ala Ser Pro Arg Ala Val lie Gly Gly 

115 -45 -40 -35 
116 

117 CGC AGA CGG AGG AGG ACG GCG TTG GCC GGG ACG CGG ACA GCG CAG GGC 528 

118 Arg Arg Arg Arg Arg Thr Ala Leu Ala Gly Thr Arg Thr Ala Gin Gly 

119 -30 -25 -20 -15 
120 

121 AGC GGC GGG GGC GCG GGC CGG GGC CAC GGG CGC AGG GGC CGG AGC CGC 576 

122 Ser Gly Gly Gly Ala Gly Arg Gly His Gly Arg Arg Gly Arg Ser Arg 

123 -10 -5 1 
124 

125 TGC AGC CGC AAG CCG TTG CAC GTG GAC TTC AAG GAG CTC GGC TGG GAC 624 

126 Cys Ser Arg Lys Pro Leu His Val Asp Phe Lys Glu Leu Gly Trp Asp 

127 5 10 15 
128 

12 9 GAC TGG ATC ATC GCG CCG CTG GAC TAC GAG GCG TAC CAC TGC GAG GGC 672 

13 0 Asp Trp lie lie Ala Pro Leu Asp Tyr Glu Ala Tyr His Cys Glu Gly 
131 20 25 30 

132 

133 CTT TGC GAC TTC CCT TTG CGT TCG CAC CTC GAG CCC ACC AAC CAT GCC 720 

134 Leu Cys Asp Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn His Ala 

135 35 40 ' 45 50 
136 

13 7 ATC ATT CAG ACG CTG CTC AAC TCC ATG GCA CCA GAC GCG GCG CCG GCC 768 
13 8 lie lie Gin Thr Leu Leu Asn Ser Met Ala Pro Asp Ala Ala Pro Ala 
139 55 60 65 

140 

141 TCC TGC TGT GTG CCA GCG CGC CTC AGC CCC ATC AGC ATC CTC TAC ATC 816 

142 Ser Cys Cys Val Pro Ala Arg Leu Ser Pro lie Ser lie I^eu Tyr lie 

143 70 75 80 
144 

145 GAC GCC GCC AAC AAC GTT GTC TAC AAG CAA TAC GAG GAC ATG GTG GTG 864 

146 Asp Ala Ala Asn Asn Val Val Tyr Lys Gin Tyr Glu Asp Met Val Val 

147 85 90 95 
148 

149 GAG GCC TGC GGC TGC AGG TAGCGCGCGG GCCGGGGAGG GGGCAGCCAC 912 

150 Glu Ala Cys Gly Cys Arg 

151 100 
152 



PAGE: 4 RAW SEQUENCE LISTING DATE: 01/1 1/2002 

PATENT APPLICATION US/09/945,182 TIME: 22:33:10 

INPUT SET: S36718.raw 

153 GCGGCCGAGG ATCC 926 

154 

155 

156 (2) INFORMATION FOR SEQ ID NO : 2 : 
157 

158 (i) SEQUENCE CHARACTERISTICS: 

159 (A) LENGTH: 294 amino acids 

160 (B) TYPE: amino acid 

161 (D) TOPOLOGY: linear 
162 

163 (ii) MOLECULE TYPE: protein 

164 

165 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

166 

167 Ala Arg Asn Thr Thr His Tyr Arg Ala Asn Trp Val Arg Gly Pro Gly 

168 -190 -185 -180 -175 
169 

170 Ser Trp Thr Ser Pro Pro Leu Leu Leu Leu Ser Thr Cys Pro Gly Ala 

171 -170 -165 -160 
172 

173 Ala Arg Ala Pro Arg Leu Leu Tyr Ser Arg Ala Ala Glu Pro Leu Val 

174 -155 , -150 -145 
175 

176 Gly Gin Arg Trp Glu Ala Phe Asp Val Ala Asp Ala Met Arg Arg His 

177 -140 -135 -130 
178 

179 Arg Arg Glu Pro Arg Pro Pro Arg Ala Phe Cys Leu Leu Leu Arg Ala 

180 -125 -120 -115 
181 

182 Val Ala Gly Pro Val Pro Ser Pro Leu Ala Leu Arg Arg Leu Gly Phe 

183 -110 -105 -100 -95 
184 

185 Gly Trp Pro Gly Gly Gly Gly Ser Ala Ala Glu Glu Arg Ala Val Leu 

186 -90 -85 -80 
187 

188 Val Val Ser Ser Arg Thr Gin Arg Lys Glu Ser Leu Phe Arg Glu lie 

189 -75 -70 -65 
190 

191 Arg Ala Gin Ala Arg Ala Leu Gly Ala Ala Leu Ala Ser Glu Pro Leu 

192 -60 -55 -50 
193 

194 Pro Asp Pro Gly Thr Gly Thr Ala Ser Pro Arg Ala Val lie Gly Gly 

195 -45 -40 -35 
196 

197 Arg Arg Arg Arg Arg Thr Ala Leu Ala Gly Thr Arg Thr Ala Gin Gly 

198 -30 -25 -20 -15 
199 

200 Ser Gly Gly Gly Ala Gly Arg Gly His Gly Arg Arg Gly Arg Ser Arg 

201 -10 -5 1 
202 

203 Cys Ser Arg Lys Pro Leu His Val Asp Phe Lys Glu Leu Gly Trp Asp 

204 5 10 15 
205 



* 



PAGE: 5 RAW SEQUENCE LISTING DATE: 01/1 1/2002 

PATENT APPLICATION US/09/945,182 TIME: 22:33:10 

INPUT SET: S36718.raw 

206 Asp Trp lie lie Ala Pro Leu Asp Tyr Glu Ala Tyr His Cys Glu Gly 

207 20 25 30 
208 

209 Leu Cys Asp Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn His Ala 

210 35 40 45 50 
211 

212 lie lie Gin Thr Leu Leu Asn Ser Met Ala Pro Asp Ala Ala Pro Ala 

213 55 60 65 
214 

215 Ser Cys Cys Val Pro Ala Arg Leu Ser Pro lie Ser lie Leu Tyr lie 

216 70 75 80 
217 

218 Asp Ala Ala Asn Asn Val Val Tyr Lys Gin Tyr Glu Asp Met Val Val 

219 85 90 95 
220 

221 Glu Ala Cys Gly Cys Arg 

222 100 
223 

224 (2) INFORMATION FOR SEQ ID NO: 3: 
225 

226 (i) SEQUENCE CHARACTERISTICS: 

227 (A) LENGTH: 1207 base pairs 

228 (B) TYPE: nucleic acid 

229 (C) STRANDEDNESS : single 
23 0 (D) TOPOLOGY: linear 
231 

232 (ii) MOLECULE TYPE: DNA (genomic) 

233 

234 (vi) ORIGINAL SOURCE: 

23 5 (A) ORGANISM: Homo sapiens 
236 

237 (vii) IMMEDIATE SOURCE: 

238 (B) CLONE: MP52 
239 

24 0 (ix) FEATURE: 

241 (A) NAME/KEY: CDS 

242 (B) LOCATION: 845.. 1204 
243 

?. 244 

245 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

246 

247 ACCGGGCGGC CCTGAACCCA AGCCAGGACA CCCTCCCCAA ACAAGGCAGG CTACAGCCCG 60 
248 

24 9 GACTGTGACC CCAAAAGGAC AGCTTCCCGG AGGCAAGGCA CCCCCAAAAG CAGGATCTGT 120 
250 

251 CCCCAGCTCC TTCCTGCTGA AGAAGGCCAG GGAGCCCGGG CCCCCACGAG AGCCCAAGGA 180 
252 

253 GCCGTTTCGC CCACCCCCCA TCACACCCCA CGAGTACATG CTCTCGCTGT ACAGGACGCT 240 
254 

255 GTCCGATGCT GACAGAAAGG GAGGCAACAG CAGCGTGAAG TTGGAGGCTG GCCTGGCCAA 3 00 

256 

257 CACCATCACC AGCTTTATTG ACAAAGGGCA AGATGACCGA GGTCCCGTGG TCAGGAAGCA 360 



SEQUENCE VERIFICATION REPORT 
PATENT APPLICATION US/09/945,182 



DATE: 01/11/2002 
TIME: 22:33:11 



INPUT SET: S36718.raw 



Original Text 



SEQUENCE MISSING ITEM REPORT 
PATENT APPLICATION US/09/945,182 



DATE: 01/11/2002 
TIME: 22:33:11 



INPUT SET: S36718.raw 

< < THERE ARE NO ITEMS MISSING > > 



PAGE: 1 SEQUENCE CORRECTION REPORT DATE: 01/11/2002 

PATENT APPLICATION US/09/945,182 TIME: 22:33:11 

INPUT SET: S36718.raw 



Line 
1945 



Original Text 

(C) INDIVIDUAL ISOLATE: primer number 8 



Corrected Text 

(C) INDIVIDUAL ISOLATE: primer number 8 



